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Abstract 



. We determine the uncertainty on the strong coupUng as due to the experimental 

. errors on the data fitted in global analysis of hard-scattering data, within the standard 

, framework of leading-twist fixed-order collinear factorisation in the MS scheme, finding 

O ; that as{M^) = 0.1202+°:°[]J2 next-to-leading order (NLO) and as(M|) = 0.117ll[j:[]|]J^ 

I at next-to-next-to- leading order (NNLO). We do not address in detail the issue of the 

additional theory uncertainty on as (Af|), but an estimate is ±0.003 at NLO and at 



most ±0.002 at NNLO. We investigate the interplay between uncertainties on as and 
uncertainties on parton distribution functions (PDFs). We show, for the first time, how 
both these sources of uncertainty can be accounted for simultaneously in calculations of 
cross sections, and we provide eigenvector PDF sets with different fixed as values to 
allow further studies by the general user. We illustrate the application of these PDF sets 
by calculating cross sections for W, Z, Higgs boson and inclusive jet production at the 
Tevatron and LHC. 



1 Introduction 



There has been a steady improvement in the precision and the variety of the data for deep- 
inelastic and related hard-scattering processes. Global parton analyses of these data allow the 
partonic structure of the proton to be quantified with ever-increasing accuracy. Analyses are 
now possible at next-to-next-to- leading order (NNLO) in the running strong coupling asiQ"^)- 
Indeed, the accuracy is such that the theoretical formalism needs to consider effects at the level 
of 1%. One area which needs careful treatment at this level of accuracy is the strong coupling 
itself. This applies both to the definition of the coupling, and to the uncertainties associated 
with it. The definition of 0:5 is not unique and different definitions can give noticeable differences 
in the results. In the recent "MSTW 2008" global fit [1] we changed the definition of the strong 
coupling compared to that used in previous "MRST" analyses. We discuss the technical details 
of this change in detail in the Appendix. However, the main emphasis of this article is the 
issue of the uncertainty on parton distribution functions (PDFs) and derived quantities such as 
cross sections, arising from the uncertainty in the strong coupling as- This is a non-trivial, and 
sometimes significant, source of uncertainty which is often not considered when extracting PDFs 
from global fits or when calculating uncertainties on hadronic cross sections^ The interplay 
of the PDFs, their uncertainties and as is both interesting in itself and important for making 
precise cross section predictions, and in this paper we give the first comprehensive account of 
this subject. 



2 Parton distributions and the strong coupling 

The recent MSTW analysis [1] (using an improved definition of as) was based on the obtained 
"best-fit" values of the strong coupling, i.e. the values obtained by minimisation of a global 
goodness-of-fit quantity, Xgiobah simultaneously with the PDF parameters at an input scale 
Ql = l GeV^. In the NLO and NNLO global PDF fits, the values were found to be as{Ml) = 
0.1202 and 0.1171, respectively. In addition to the best-fit PDFs and corresponding best-fit 
as values, we also determined a set of eigenvector ("error") PDF sets, designed to span the 
variation in the parton distributions allowed by the uncertainties on the data included in the 
global fit. We introduced and used a new "dynamic tolerance" method for determining 68% or 
90% confidence- level (CL.) error PDFs, which is an extension of an earlier method introduced 
in the CTEQ6 analysis [4-6] . Then the prediction and corresponding uncertainty for a physical 
quantity F that depends on the PDFs, such as a cross section at a hadron-hadron collider, is 

^See, for example, Refs. [2,3] for analyses where the uncertainty on as is included in the theoretical uncer- 
tainty on predicted cross sections. 
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Here, is the central PDF set, and (k = l,...,n) are the eigenvector PDF sets, with 
n = 20 in the MSTW 2008 analysis [1]. To illustrate our method and its results, we used as 
examples in Ref. [1] the total cross sections for and Z production at the Tevatron and LHC, 
i.e. we computed the theoretical predictions for the total cross sections and their corresponding 
PDF uncertainties, a ± Adpop. 

However, the PDF sets obtained in this previous study are only defined for the particular 
values of as'(Ml) found by the best fit. If the user has a preferred (different) value of asiM"^) 
then PDFs determined with that particular value in the global analysis should strictly be 
used. Hence, in this paper we provide best-fit PDFs for a range of values of as'(Ml). This 
is a straightforward matter of repetition. However, when considering uncertainties on PDFs 
and on physical quantities the process is more involved. The true theoretical uncertainty on 
a predicted cross section should also include a contribution from the allowed variation of as 
about its best-fit value. This is particularly important for cross sections that at leading-order 
are proportional to a power of the couphng, for example, cniggs oc a| for production via gluon- 
gluon fusion at the Tevatron and LHC. A naive way of doing this would be to define the overall 
theory error as Aa^^ = AcTppp + Aa^^, where Acr^g is the cross section variation when as is 
allowed to vary within some range, determined, for example, by its world-average error, for a 
fixed set of PDFs. However, it is not consistent to use different values of as in the partonic 
cross section describing the hard subprocess and in the PDF evolution. Moreover, in a global 
PDF analysis, there are non-negligible correlations between the PDFs and the value of the 
strong coupling. For example, the evolution of the structure function F2 at small x is driven 
by the combination as g, and so a large value of the coupling will correspond, in principle, to 
a smaller gluon distribution at low x. This is indeed observed in practice. 

This PDF-a5 correlation has been known for some time, and indeed early global analyses 
derived a series of best-fit PDFs for a set of discrete as values spanning a given range, for 
example, asi^M"^) = 0.105, 0.110, . . . , 0.130 [7]. Other relevant studies have been made by the 
CTEQ group [2,8]. However, given that the precision of the data used in global analyses has 
increased significantly in recent years, with a corresponding reduction in the error on both 
the PDFs and on as, a more sophisticated approach is now required, that is, one in which 
(i) the allowed variation of as is determined by the data, and (ii) the correlation between the 
variation in and in the PDFs is fully taken into account. The latter would be very difficult 
to achieve if 05 (M|) and its uncertainty were taken as externally determined quantities rather 
than obtained directly from the fit. 

^In practice, we use slightly more precise formulae, given in Eqs. (51,52) of Ref. [1], leading to asymmetric 
PDF uncertainties; see also Eqs. ([7]) and ([8]) below. 
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In this paper we investigate the interplay between uncertainties on as and uncertainties on 
parton distributions in the framework of the recent MSTW 2008 analysis [1] . We show how both 
these sources of uncertainty can be properly accounted for simultaneously in calculations of cross 
sections. The remainder of this paper is organised as follows. In Section [3] we discuss how the 
description of each data set included in the global fit depends on the value of as- In Section H] we 
determine the ila (that is, the 68% C.LJ and 90% C.L. uncertainties on as{M'^) as determined 
in the NLO and NNLO global analysesjfl Moreover, we identify which particular data sets give 
the strongest constraints on the value of a5(M|). In Section [5] we derive eigenvector PDF 
sets with a5(M|) fixed at the limits of the 68% (and also 90%) C.L. uncertainty region. In 
Section [6] we illustrate our results using predictions for the W^, Z, Higgs and inclusive jet cross 
sections at the Tevatron and LHC, i.e. we derive uncertainties on these cross section predictions 
which take both the allowed variation of and the PDFs fully into account. We conclude in 
Section [71 Finally, we devote an Appendix to a discussion of the definition of as used in our 
recent MSTW 2008 analysis [1], and to a comparison with the earlier form used in the MRST 
analyses. The importance of a consistent definition is especially important at NNLO, where, 
in the MS scheme, the coupling asiQ"^) is discontinuous, albeit by a very small amount, as the 
scale increases through the heavy flavour thresholds. 



3 Description of data sets as a function of as 

The MSTW 2008 global fit [1] used a wide variety of data from both fixed-target experiments 
and the HERA ep and Tevatron pp colliders. Neutral-current structure functions {F2 and 
Fi) were included from fixed-target lepton-nucleon scattering experiments (BCDMS [9, 10], 
NMC [11,12], E665 [13] and SLAC [14-16]), low-mass Drell-Yan cross sections from the 
E866/NuSea experiment [17,18], and charged-current structure functions {F2 and 0:^3) and 
dimuon cross sections from neutrino-nucleon scattering experiments (CCFR/NuTeV [19, 20] 
and CHORUS [21]). From the HERA experiments, HI and ZEUS, data were included on 
neutral- and charged-current reduced cross sections (a^^ and o"^^) [22-30], the charm struc- 
ture function (iT'charm^ [31-37], and inclusive jet production in deep- inelastic scattering [38-40]. 
From the Tevatron experiments, CDF and D0, Run II data were included on inclusive jet 
production [41,42], the lepton charge asymmetry from W decays [43,44] and the Z rapidity 
distribution [45,46]. A more detailed description of the treatment of each of these data sets 
can be found in Ref. [1]. 

The definition of the goodness-of-fit quantity, xln fo^' each data set n is given in Section 5.2 
of Ref. [1] . Note that the normalisation of each data set in the global analysis is taken as a fitted 
free parameter to allow for uncertainties in the measured luminosities, with a quartic Xn penalty 

•^Note that we do not consider LO fits in the current analysis, since the overall quality of the LO global fit 
of Xgiobai — 3066 for 2598 data points is significantly worse than those at NLO and NNLO, and therefore there 
is no statistically meaningful constraint on as at this order. As discussed in Ref. [1], the large best-fit value of 
as(Af§) = 0.13939 obtained is an attempt to mimic missing higher-order terms. 
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term given by Eq. (37) of Ref. [1] rather than the more usual quadratic penalty term, in order to 
bind the fitted data set normalisations more closely to their nominal values of 1, and to recognise 
that the normalisation uncertainties may well not be Gaussian in nature. For most data sets, 
particularly the older ones where correlations are unavailable or statistical uncertainties often 
dominate, the statistical and systematic (other than normalisation) uncertainties are simply 
added in quadrature in the definition of Xn- However, for the data on inclusive jet production, 
where at the Tevatron the systematic errors are often many times larger than the statistical 
errors, and the CDF Z rapidity distribution, the full information on correlated systematic 
uncertainties is included by defining Xn ^'i- (38) of Ref. [1], i.e. the data points are 

allowed to shift by the correlated systematic uncertainties in order to give the best fit, with a 
quadratic penalty term to limit large shifts by more than the experimental errors. 

We plot the Xn profiles for each data set n as the difference from the value at the global 
minimum, Xnc when varying as{M^)] see Figs. [THUfor the NNLO Xn profiles for each of the 
29 different types of data. The points (•) in Figs. [THl] are generated for fixed values of a5(M|) 
between 0.107 and 0.127 in steps of 0.001. These points are then fitted to a quadratic function 
of q;5(M|) shown by the solid curves. The horizontal dashed lines in the plots indicate the 68% 
and 90% C.L. limits for each data set, determined according to a "hypothesis-testing" criterion 
which we will describe below in Section HJ 

We see from Figs. [THl] that for most data sets the variation of Xn with respect to asi^M^) is 
indeed approximately quadratic, with minima in the vicinity of the value of as{M^) determined 
by the global fit. However, a few data sets have minima which are significantly displaced from 
the global best-fit a5(M|) values of 0.1171 at NNLO (or 0.1202 at NLO). For example, the 
BCDMS data on F^^^''^^ prefer a5(M|) values around 0.110. With the global best-fit as, in 
the region 0.3 < a; < 0.7 falls too quickly with increasing 

Q2 for the BCDMS data, particularly 
for low (and hence low inelasticity y), and so a lower 0:5 is preferred since it gives a flatter 

slopeo By contrast, the NMC Fl^^^''''^ data sets prefer as values somewhat larger than the 
global average, since the increase of these data with is quicker than the default fit at NLO 
for X < 0.1, so a bigger as increases the evolution speed (as do NNLO corrections, hence this 
problem is reduced at NNLO). Peculiarly, the E665 data, particularly F!^'^ , show the opposite 
trend to the NMC data in the same region of x, and therefore the E665 data prefer low as 
values at NNLO (with the NLO fit giving a better description). Note also from Fig. [3] that 
within the full global fit the HI and ZEUS cr^*" data (mostly at small x) have minima at slightly 
high values of asiM"^) f» 0.121, implying that slightly stronger evolution than with the overall 
best fit is preferred. A similar relative feature is found at NLO. The slight tension between the 

slopes of the structure functions from various experiments and the overall best-fit theory 
predictions can be seen clearly in Fig. 24 of Ref. [1] (see also Figs. 3-6 of Ref. [47]). 

We find that the preference of the inclusive HERA data for high as values holds also when we 
perform a NLO fit to these data alone, which at first sight seems to contradict previous findings 

■^The low y data points from BCDMS are strongly affected by the energy scale uncertainty of the scattered 
muon. It has been advocated to impose a cut of y > 0.3 on the BCDMS data, which caused as{M'^) to increase 
by about 0.004 in a fit to only BCDMS data and by about 0.002 in a combined fit to HI and BCDMS data [23]. 
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Figure 1: profiles for tlie first subset of data sets n in tlie NNLO fit, wlien varying as{M\). 
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MSTW 2008 NNLO (aJ PDF fit 
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Figure 2: Xn profiles for tlie second subset of data sets n in tlie NNLO fit, wlien varying as{Mz). 
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MSTW 2008 NNLO (aJ PDF fit 
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MSTW 2008 NNLO (aJ PDF fit 
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by HI [23] and ZEUS [48] indicating a preference for low values of 0.115 and 0.110, respectively, 
although each with a large uncertainty, when fitting only to inclusive data. Relatively low values 
of 05 are also preferred by the inclusive HERA 96/97 data sets in the CTEQ global analysis (see 
Fig. 4 of Ref. [2]). However, the low-x gluon distribution is strongly anticorrelated with as- The 
input gluon parameterisation in the HI [23] , ZEUS [48] and CTEQ [2] analyses was assumed to 
have the form xg ~ x^<^ at low x, whereas the more flexible MSTW [1] (and MRST [47]) input 
gluon parameterisation is effectively a sum of two powers at low x, where the second term is 
required by the fit to be negative, at the input scale = 1 GeV^. We find that a NLO fit 
to only inclusive HERA data using a restricted input gluon parameterisation, with the second 
term omitted, gives as{M'^) ~ 0.110 (with a large uncertainty). Repeating the global fits gives 
as{Ml) = 0.1175 at NLO and as(M|) = 0.1157 at NNLO. However, the global is worse by 
63 at NLO and by 80 at NNLO, so the more flexible low-x gluon parameterisation is clearly 
required (particularly for the PDF uncertainties, see Section 6.5 of Ref. [1]). Since the CTEQ 
PDFs are input at the slightly higher Qq = 1.69 GeV^, and other sets have even higher input 
scales, the effect of only having a single power for the gluon parameterisation will be reduced 
compared to our above example. However, the general feature of a reduction in as{M^) due 
to a restricted parameterisation is very likely to persist to a greater or lesser extent. 

The data set that exhibits particularly anomalous behaviour, where Xn significantly re- 
duced for larger a5(M|), is the D0 Run 11 W ^ fiu charge asymmetry; see Fig. HI We have 
already pointed out the difficulties of fitting these data in the standard NLO and NNLO global 
analyses (see Section 11.1 and Fig. 44 in Ref. [1]), with both fits yielding a Xno 25 for 10 
data points. We also noted that the asymmetry is sensitive to the separation into valence and 
sea quarks, particularly at lower lepton px- Indeed, this explains the behaviour of xi with 
q;5(M|): as as{M'^) increases, the valence quarks evolve more rapidly at high x than the sea 
quarks, and the W asymmetry is reduced. For as{Mz) — 0.126 at NNLO (or ~ 0.129 at NLO) 
the decrease in Xn is approximately 15, and a good fit is obtained. However, such large as^M"^) 
values are completely inconsistent with most of the other data sets in the global analysis (and 
with other determinations of 0:5). 

The NLO Xn profiles are similar to those at NNLO for most data sets. In Fig. [S] we 
compare the NLO and NNLO Xn profiles for those data sets where there are notable differences 
between the two fits. Specifically, the Xn profiles for the HI and ZEUS F^'^^^™ data, the 
NMC/BCDMS/SLAC Fl data and the E866/NuSea pp Drell-Yan cross sections are clearly 
more quadratic at NNLO than at NLO, with minima closer to the best-fit values. This indicates 
a strong preference for the NNLO description, which is not so apparent if only the global best-fit 
values Xnfi known. Let us give the reasons for this, taking each of these data sets in turn. 
The Xn minimum for the _P|i^'*''™ data at NNLO is consistent with the global average value of 
05 (M|), whereas a minimum is not even visible in the NLO plot. It was shown in Ref. [49] that 
at NNLO the value of dF2^'^^"^ /dlnQ'^ is lowered compared to the NLO value. The preference 
of the F2"harm ^^^^^ ^^lis is borne out by comparing both the values of Xn each order and 
the shape of the profile, i.e. the NLO fit tries to flatten the slope with a lower coupling. The 
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NNLO coefficient functions for Q^) [50,51] are positive and significant, and similarly the 

NLO fit tries to mimic these with a higher value of as{M'^). 

The explanation of the different NLO and NNLO proffies for the E866/NuSea pp Drell-Yan 
cross sections, seen in Fig. El is more complicated. The positive NNLO correction [52] requires 
an increased E866/NuSea normalisation (1.09 at NNLO compared to 1.01 at NLO, cf. the 1-a 
normalisation uncertainty of 6.5%), with a correspondingly larger Xn penalty (3.2 at NNLO 
compared to practically zero at NLO). When varying ^^(Mf ) in the range shown in Fig. [5l the 
Xn penalty term due to the fitted normalisation of the Drell-Yan data set is negligible at NLO 
(< 1 unit), while it increases dramatically at NNLO when increasing as (Ml;) from the best-fit 
value, reaching 22 units when a5(M|) = 0.127, i.e. a sizeable proportion of the total increase 
in Xn relative to the best-fit as- The Drell-Yan data slightly prefer a higher value of as at 
NLO than the global best-fit value, while at NNLO a lower value of 0:5 is preferred mainly in 
order to reduce the Xn penalty term due to the fitted normalisation. 

The complete NNLO corrections are not yet known for inclusive jet production in deep- 
inelastic scattering or in hadron-hadron collisions. For the Tevatron data, we use the approxi- 
mation to the NNLO corrections obtained from threshold resummation [53], which is included 
in the fastnlo package [54]. No such approximation is available for the HERA data, and so 
the HERA data on inclusive jet production are simply omitted from the NNLO global fitjfl 
More discussion of these issues is given in Ref. [1]. 

For completeness, in Fig. [6](a) we show the Xn profiles for the HERA inclusive jet production 
data in the NLO fit, since these HI and ZEUS data sets are not included in the NNLO fit. 
Although the impact on the high-, gluon distribution i. diktej in a global parton analysis, 
it is seen from Fig. [6] that these data sets are quite sensitive to the value of 0:5, with the HI 
data preferring as^M^) — 0.113 and the ZEUS data preferring q;s(M|) ~ 0.118. We also show 
in Fig. [6](b) the corresponding NLO plots for the Tevatron inclusive jet production data: the 
preferred values of ^^(Ml) are 0.120 for the CDF data (fc-r jet algorithm) and 0.119 for the 
D0 data (cone jet algorithm), which are very consistent with the best-fit values. 

We provide public PDF grids (without uncertainties) for fits with as{M'^) in the range 0.110 
to 0.130 (at NLO) and 0.107 to 0.127 (at NNLO) in steps of 0.001 [56]. These grids will allow 
users to make 0:5 determinations from other data sets. Note that we have implicitly done this 
for each of the individual data sets in Figs. [TH6|, where the values of as{M^) at the minima 
can be read off from the plots, together with an experimental uncertainty for an appropriate 
choice of Axn = xi ~ xi min- (Generally, the 1-a experimental uncertainty is taken to be the 

^Recent progress in the NNLO calculation of inclusive jet production in deep-inelastic scattering has been 
made by the derivation of the relevant two-loop QCD helicity amplitudes [55]. 

^In a restricted fit comprising only ZEUS data on tr^^ and cr^'"', the addition of jet cross sections from 
deep-inelastic scattering and photoproduction was found to significantly decrease the mid- to high- a; gluon 
uncertainty without changing the central value [48]. However, within a global fit [1] constraints are placed on 
the high-x gluon not only by the Tevatron inclusive jet data, but also by fixed-target structure functions, such 
that the impact of HERA inclusive jet cross sections is reduced. 
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Figure 6: Xn profiles for tlie (a) HERA and (b) Tevatron inclusive jet production data at NLO. 
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change in 05 (M|) which gives an increase of one unit in Xn with respect to the minimum 
value.) However, more detailed studies may be desirable, for example, including a theoretical 
uncertainty in the as{M^) determination from the observed renormalisation scale dependence. 
Moreover, as extractions from other quantities may be of interest, for example, from the ratio 
of three-jet to two-jet rates at the Tevatron. 

4 Experimental uncertainty on as{M'^) 

Ideally, we would expect the errors on as{Ml) to be given by Axgiobai = Xgiobai " Xmin = 
1 or 2.71 for a 68% or 90% CL. limit respectively. However, in practice, there are some 
inconsistencies between the independent fitted data sets, so these "parameter-fitting" criteria 
are not appropriate for global PDF analyses. Instead, we follow the procedure of Section 6.2 
of Ref. [1], where we described how to choose an appropriate value of the tolerance T = 
(AXg[Q|_,3^j)^/^ for each eigenvector of the covariance matrix according to "hypothesis-testing" 
criteria. Here, we will use the same method to determine the appropriate uncertainty on 
a5(M|). To summarise, we perform the following steps. 

We define the 90% CL. region for each data set n (comprising data points) by the 
condition that [1] 



where ^90 is the 90th percentile of the x^-distribution with degrees of freedom, and ^50 — 
is the most probable value. (These quantities are defined in detail in Section 6.2 of Ref. [1]; 
see also the example discussed around Eq. ([6]) below.) Similarly for the 68% CL. region. The 
90% and 68% CL. regions determined in this way are shown as the horizontal dashed lines 
in Figs. [IHni We then record the values of 05 (M|) for which the Xn for each data set n are 
minimised, together with the 90% and 68% CL. limits defined by the intercepts of the quadratic 
curves with the horizontal dashed lines in Figs. [THHl 

In Fig. [7]the points (•) indicate these values of as{M'^) for which Xn is minimised, while the 
inner error bars extend across the 68% CL. region and the outer error bars extend across the 
90% CL. region defined by Eq. ([2]). Note that inclusive jet production at the Tevatron is the 
only process included in the global fit which is proportional to a| at leading-order, therefore 
these data sets provide a strong constraint on as- However, it might initially be considered 
surprising that the error bars in Fig. [7] are much smaller for CDF than for D0. Both factors 
on the right-hand side of Eq. ([2]) are smaller for CDF than for D0, meaning that the 90% 
CL. region for Xn ~ Xno; indicated by the horizontal dashed lines in Fig. Mjo) at NLO (and 
the corresponding plots in Figs. [3] and H] at NNLO), is smaller by almost a factor of 2 for 
CDF. More importantly, the Xn profiles are significantly steeper for CDF compared to D0, 
i.e. the CDF data are more sensitive to the as value. This can largely be explained by the fact 
that the normalisation of the CDF jet data is tied to the more constraining CDF Z rapidity 
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Figure 7: Ranges of as{M'^) for which data sets are described within their 90% C.L. hmit 
(outer error bars) or 68% C.L. hmit (inner error bars) in the (a) NLO and (b) NNLO global 
fits. The points (•) indicate the values of as{M^) favoured by each individual data set n, 
that is, the values for which Xn minimised. The uncertainty on a5(M|), indicated by the 
horizontal dashed lines, is chosen to ensure that all data sets are described within their 68% or 
90% C.L. limits defined by Eq. 
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distribution, due to the common luminosity uncertainty. This is an interesting example of the 
interplay between data sets in a global fit. The normalisation of the D0 jet data is independent 
of the D0 Z data, since the latter is presented as a rapidity shape distribution, i.e. divided by 
the measured Z total cross section. The D0 jet data therefore has more freedom to compensate 
for the variation in as by changing the normalisation. 

We choose the uncertainty on asiM'z)-, indicated by the horizontal dashed lines in Fig. [TJ 
so that all data sets are described within their 90% or 68% C.L. limits. It is seen from Fig. [7] 
that the upper limit on as'(Ml) is fixed by the deterioration of the quality of the fit to the 
BCDMS F^^ data (within the context of the full global fit), while the lower limit is provided 
by the deterioration of the quality of the fit to the SLAG F^'^ data. In each case a number of 
additional data sets are very close to providing the constraint, i.e. as with the determination 
of the tolerance values in Ref. [1] there is no particular reliance on any one data set in fixing 
the limits on asiM'z)- The final results are: 

NLO: a5(M|) = 0.1202 (68% C.L.) (90% C.L.), (3) 

NNLO: as{Ml) = 0.1171 l^ou (68% C.L.) t'ofolt (90% C.L.). (4) 

In Fig. [S]we show the change in Xgio^ai a5(M|) is varied. The vertical lines indicate the 68% 
and 90% C.L. uncertainties given by Eqs. ([3]) and (jl]). The uncertainties on as{M^) at NNLO, 
determined using the procedure just described, amount to taking Axgi^j^j^j =~^l'^ (68% C.L.) or 
lill (90% C.L.). These are much larger values than the canonical "parameter-fitting" values of 
Axgiobai = 1 (68% C.L.) or 2.71 (90% C.L.). Conversely, if these standard "parameter-fitting" 
values of Axgi^j^g^i were taken seriously, then the uncertainties on a5(M|) would be ^aooog (68% 
C.L.) or ^aoon (90% C.L.). The experimental errors on 0:5 given in Eqs. ([3]) and (jlj) are a much 
refined revision of the previous estimate of ±0.002 from the MRST 2001 analysis [47] obtained 
using a fixed Axgiobai = 20. 

In Table [T] we compare our determination of ^^(Ml) at NLO and NNLO with the results 
obtained by other PDF fitting groups, showing only the experimental uncertainties. The values 
in Table [1] can be compared with the world average value quoted by the Particle Data Group 
(PDG) of as{Ml) = 0.1176±0.002 [63], where the PDG world average is dominated by NNLO 
results, with the NLO results included carrying little weight. The 2009 world average value of 
a5(M|) obtained by Bethke [64] is 0.1184±0.0007. The MSTW values are completely consistent 
with the world average values, but are a little higher than all the others in Table [U which are 
mostly from fits mainly, or exclusively, to structure functions in deep-inelastic scattering. The 
NNLO values in Table [1] are generally smaller than the corresponding NLO values from the 
same fitting group, as is to be expected since NNLO corrections to splitting functions and 
coefficient functions automatically lead to quicker evolution. The CTEcjll global analysis [2] 
includes additional data on fixed-target Drell-Yan production and Tevatron data on the W — >■ 

^Thc CTEQ analysis uses a definition of as at NLO which at low scales lies above other definitions for the 
same value of as(M^), and tends to result in an extraction of as{M'^) about 0.001 lower than the same fit 
using the other definitions, see Figs. 8 and 9 of Ref. [8]. 
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NLO 


0:5 (M|) (expt. unc. only) 


MSTW (this work) 
CTEQ [2] 
HI f23l 

J- J- -L 

ZEUS [48] 
Alekhin [57] 
BBG [58] 
GJR [59] 


1202 

U.iZUZ _o.0015 

0.1170 ±0.0047 
11 50 ±0 0017 
0.1183 ±0.0028 
0.1171 ±0.0015 
0.1148 ±0.0019 
0.1145 ±0.0018 




NNLO 


as{M'^) (expt. unc. only) 


MSTW (this work) 
AMP [60] 
BBG [58] 
ABKM [61] 
JR [62] 


1171 +°-°°^^ 

U.l±(± _0.0014 

0.1128 ±0.0015 

n 1 1 34 +0-0019 

U.iiO^ -0.0021 

0.1129 ±0.0014 
0.1158 ±0.0035 



Table 1: Comparison of the present as determination at NLO and NNLO with the values 
obtained by other PDF fitting groups, showing only the experimental uncertainties. 

iu asymmetry and inclusive jet production. The HI analysis [23] includes additional BCDMS 
data, the ZEUS analysis [48] includes jet cross sections from deep-inelastic scattering and 
photoproduction, the GJR analysis [59] includes E866/NuSea Drell-Yan and Tevatron inclusive 
jet data, and the AMP [60], ABKM [61] and JR [62] analyses all include fixed-target Drell-Yan 
data. The BBG fits [58] are from a non-singlet analysis. We quote the results of the "standard" 
fits by GJR/JR [59,62] and not the results of the "dynamical" fits using a more restricted input 
parameterisation, which were the focus of these analyses but gave a slightly worse description 
of data, with a5(M|) values lower by around 0.003 than the values from the corresponding 
"standard" fits (and with smaller uncertainties due to the significant theoretical constraint on 
the input). This is another, rather extreme, example of the point raised in Section [3l i.e. the 
correlation between a restricted input parameterisation for the gluon distribution and low values 
of ^^(Ml). Indeed, this is very likely to be part of the reason for the lower values of all the 
extractions in Table [1] compared to MSTW. The exception is BBG [58], which is a non-singlet 
analysis, and by definition relies on a much smaller data set and on the data cuts being sufficient 
that purely non-singlet evolution is applicable. 

Indeed, there is a distinct sensitivity to the data sets used. The fits containing only a limited 
number of data sets tend to obtain lower values of as- This is often because the BCDMS data 
has a strong influence in the fit, as seen in Fig. [71 This tendency for the BCDMS data to bring 
the value of ^^(Ml) down was well-illustrated in the HERA-LHC "benchmark" fits [65,66], 
where both the data sets chosen and the cuts applied led to the BCDMS data carrying a high 
weight. It is often thought that the inclusive jet data favour high values of a5(M|), and are 
thus responsible for raising the values significantly in global fits. The picture is not quite so 
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simple, as the values preferred by these data sets shown in Fig. [7] illustrate, with none lying far 
above the global best-fit values. Indeed, in Section 12 of Ref. [1] it was shown that removing 
the Tevatron jet data results in the best-fit value of a5(M|) at NLO falling only from 0.1202 
to 0.1197. It is more the case that the inclusion of the jet data can alter the shape of the gluon 
distribution obtained in a fit to only deep-inelastic scattering data, which indirectly affects the 
value of a5(M|), or can turn a low value of as(M|) with large uncertainty into a much better 
constrained higher value, as in Ref. [48]. 

In addition to the data fitted, the differences seen in Table [T] between the experimental 
uncertainties on as(M|) obtained by different groups can be traced partly to the different 
methods used for error propagation and in particular to the choice of the tolerance, T = 
(Axgj^j^g^i)^/^. For example, the large CTEQ uncertainty corresponds to T ~ a/100/2.71 ~ 
6 compared to the MSTW values of T ~ 2-3, GJR/JR use T ~ 4.7/4.5 at NLO/NNLO 
respectively, while the other groups fit a smaller range of data and use T = 1. 

We shall not address in detail the issue of the theory uncertainty on our as determination. 
The widely-used method of varying the renormalisation and factorisation scales up and down 
by a factor of around two is not really sufficient in a global fit, as it misses contributions 
depending on ln(l/a;) and ln(l — x) at higher orders and such issues as ambiguities in heavy 
quark flavour scheme deflnitionsjfl Alekhin [57, 67] has estimated the theory uncertainty on 
as{Mz) determined from flts to deep- inelastic scattering data due to scale dependence to be 
±0.0033 at NLO and ±0.0009 at NNLO. HI [23] estimated a similar uncertainty on their 
as{M'^) determination from HI and BCDMS proton data to be ±0.005, and ZEUS [48] also 
obtained ±0.005 from varying the scale only for the jet data included in their flt. 

A simple estimate of the theory uncertainty can be obtained by taking the difference be- 
tween the NLO and NNLO determinations of a5(M|). This gives ±0.003, which could be 
considered to be a minimum theory uncertainty at NLO and a maximum theory uncertainty at 
NNLO. It coincides with the estimate at NLO quoted in the MRST 2001 analysis [47] and the 
estimate at NLO obtained from scale variation by Alekhin [67]. It is also of the same order as 
changes invoked by introducing models of higher-order corrections or changing the cuts on data 
at NLO [68], which to us seems to be one of the most reliable estimates. It is probably an over- 
estimate at NNLO and more quantitative studies are needed. However, at NNLO the estimates 
from changing data cuts and introducing models for theoretical corrections [68] suggested that 
variations in as{M'^) are distinctly lower than at NLO, and that perhaps ±0.001-0.002 would 
be more appropriate. Hence, to be conservative we estimate a NNLO theoretical uncertainty 
of at most ±0.002, a slightly larger value than that obtained from scale variation at NNLO by 
Alekhin [67] to account for the more complicated global flt. 

^Additionally, scale variation is not currently possible for our fully global fit due to various complications, 
including the general-mass variable flavour number scheme [49]. However, as one example, if the scale is changed 
only for inclusive jet production at the Tevatron, from = = px to /ijj = — Pt/'^, then as(Af|) is 
lowered by 0.001, at the expense of a slightly worse description of the D0 Run II data, as discussed in Section 
12 of Ref. [1]. 
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5 Eigenvector PDF sets with varying as 



Following on from our method of determining PDF uncertainties using the Hessian method, 
we might consider simply letting 05 (M|) be another free parameter when diagonalising the 
covariance matrix of the fit, resulting in 42 rather than 40 eigenvector PDF sets, each with a 
slightly different value of as- We do not follow this approach for various reasons. The first 
of these is because the coupling does not quite sit on an equal footing with the input PDF 
parameters. Let us consider a superposition of different eigenvector PDF sets. The linear 
nature of the DGLAP evolution equations means that any linear combination of PDFs, all 
with a common value of a;5(M|), is also a well defined PDF set evolving precisely according 
to the DGLAP evolution equations. Consequently, the precise linear combination is the same 
whatever the factorisation scale at which we sample the PDFs. But if the eigenvector PDF 
sets in the superposition have differing as{M^), then this picture for linear combinations is not 
preserved. A linear combination of PDFs evolving with different as{M'^) values do not evolve 
in precisely the same manner as one single PDF set with a fixed as, i.e. they do not follow 
precisely a real trajectory in PDF space. This is not a problem in the particular case of using 
the eigenvector PDF sets to map out the uncertainty band, since each is used independently. 
However, it is an issue when rediagonalising the Hessian matrix, such as in Ref. [69], where, 
from a well-defined starting point, the new eigenvector PDF sets would not correspond to a 
particular PDF set which evolves in precisely the manner prescribed by the evolution equations. 
Since the whole Hessian method is based on manipulating linear combinations of perturbations 
in PDF parameters, which to first order in the Taylor expansion is equivalent to manipulating 
linear combinations of PDF sets, we find this feature troubling, even though it would certainly 
be a small effect in practice. 

There are also more practical reasons why we reject the option of simply including as as an 
extra parameter in the Hessian matrix. We limit ourselves to 20 eigenvectors because a larger 
number of free input PDF parameters leads to too large correlations, and to a breakdown of 
the quadratic behaviour of the global distribution in some eigenvector directions [1]. A 
free as would introduce more correlation between parameters, and reduce the stability of the 
eigenvectors, a feature we prefer to avoid. Finally, this approach would limit the uncertainty 
analysis to an expansion about the single best-fit PDF set, with a unique value of 0:5. We 
acknowledge that the user may prefer more flexibility than this, and might wish to utilise PDF 
sets including uncertainties for a variety of different values of as- Hence, we provide eigenvector 
PDF sets with different, but fixed, as values to allow studies of this sort. In the rest of this 
section, we give the details of our extraction of PDFs with uncertainties at different as values, 
and examples of our recommended use of them will be given in Section [61 Here we just note that 
as we go away from the best-fit value of 05 (M|), the fit quality is automatically deteriorating, 
so that the PDFs cannot vary as much before the fit quality becomes unacceptable. It is an 
automatic result of our procedure that the PDF uncertainty shrinks as as{M'^) deviates from 
the preferred value determined by the global fit. 
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We provide eigenvector PDF sets with ^^(Ml) fixed at the hmits of the 68% and 90% 
C.L. uncertainty regions, given by Eqs. and (jl]). We also provide eigenvector PDF sets 
where as{M^) is fixed at half these limits. (Further intermediate values of are unnecessary, 
as we will explain later in Section [61) The eigenvector PDF sets are generated, for each 
fixed value of as, from input PDF parameters [1] 

ai(5^) = a° it^Cifc, (5) 

where a° are the best-fit input PDF parameters for that value of as, and the rescaled eigen- 
vectors are Cjfc = y/XkVik- The covariance (inverse Hessian) matrix has eigenvalues and 
orthonormal eigenvectors Vk (with components Vik)- The distance along each rescaled eigen- 
vector direction is adjusted to give the desired tolerance = (AXgiobai)^''^- determining the 
tolerance for each of the eigenvector PDF sets using Eq. ([2]), and the corresponding equation 
for the 68% C.L. uncertainties, we take Xno to be the values at the overall global minimum, 
that is, the values obtained using the best-fit value of as(M|). 

Consider the situation when ^^(Ml) is fixed at its upper 1-a limit in the NLO fit. Recall, 
from Fig. EJ that this limit is fixed by the BCDMS F^^ data set, for which Xn,o = 182.2 
for = 163 degrees of freedom. In this case we have ,^50 = 162.3 and i^es = 171.0 (while 
(^90 = 186.5); see Fig. 7 of Ref. [1]. For this data set we define the 68% C.L. region by the 
condition that 

Xl < (1^) ^68. (6) 

The rescaling factor (x^o/^so) is necessary to take account of the fact that the value of xio 
at the global minimum is quite far from the most probable value of ^50 ~ of this data set 
n. Indeed, we see that in this case the best-fit value xio — 182.2 lies outside the strict 68% 
C.L. region xl, < ^68 = 171.0. After applying the rescaling factor, the 68% C.L. region is given 
by xl - xl,o < 9.8. That is, the central fit for the BCDMS F^^ data, with asiM^) fixed at its 
upper 1-a limit, will have xl of 9.8 units worse than in the overall best fit. This can be clearly 
seen in Fig. M^a.) where we show xl ~ xlo when moving along eigenvector number 6. At zero 
distance, xl ~ xlfl = 9.8, indicated by the horizontal dashed line in Fig. [n](a). Moving in the 
negative direction along this eigenvector, the xl increases further, meaning that the tolerance 
for this eigenvector is zero in the negative direction. In the positive direction, the constraint 
which fixes the tolerance of 4.0 is provided by another data set, namely the NuTeV dimuon 
data, as seen in Fig. [9]^b). 

In Fig. M^c) we show the corresponding plot when as{M'^) is fixed at its lower 1-a limit in 
the NLO fit. Recall from Fig. [7] that this limit is fixed by the SLAC Fg^'' data, where xl is 3.4 
units worse than the best-fit value of x^o ~ 29.7. Here, the tolerance for eigenvector number 6 
is zero in the positive direction, while the constraint in the negative direction is again provided 
by the NuTeV dimuon data, this time giving a tolerance of 2.6, as seen in Fig. [Hl^d). 

The tolerance values for each of the eigenvectors, when as{M'^) is fixed at each of its 1-a 
limits in the NLO fit, are shown in Fig. [TOl The examples given for eigenvector number 6 were 
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Figure 9: Xn profiles in the NLO fit when moving along eigenvector 6 for (a) the BCDMS 
data when as{M^) is fixed at its upper 1-a limit, (b) the NuTeV dimuon data when as{M'^) 
is fixed at its upper 1-a limit, (c) the SLAC F^''' data when as{M'^) is fixed at its lower 1-a 
limit, and (d) the NuTeV dimuon data when as{Mz) is fixed at its lower 1-a hmit. 
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Figure 10: Tolerance values for each eigenvector when as{M^) is fixed to its (a) upper and 
(b) lower 1-a limits in the NLO fit. The text labels indicate the name of the data set which 
sets the tolerance constraint on each eigenvector direction. 
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typical. However, two anomalous situations are seen to arise. Firstly, when a5(M|) is fixed 
at its upper 1-a limit, then the Xn BCDMS data when moving along eigenvector 

numbers 1, 2 and 10 has a minimum (or is almost flat) at zero distance, so the tolerance is 
zero in both the positive and negative directions. Secondly, when 05 (M|) is fixed at its lower 
1-a limit, then the Xn ^^e SLAC Fg^"' data when moving along eigenvector numbers 14, 18 
and 20 has a maximum at zero distance. The tolerance is then non-zero in both the positive 
and negative directions, since the constraint is provided by data sets other than the SLAC 
^2^*^ data. Of course, in these two anomalous situations the minima/maxima in Xn "^ill 
occur at exactly zero distance, and this is an artifact of working in discrete distance units of 
i^XgiohaiY^'^i which is sufficiently accurate for our purposes. 

Comparing the tolerance plots in Fig. [10] with those in Fig. 10 of Ref. [1], we see that the 
PDF uncertainties will be much smaller (and more asymmetric) when 0:5 (M|) is fixed at each 
of its 1-0" limits, compared to when it is fixed at the best-fit value. This statement is confirmed 
in Fig. [TT] where we show the PDF uncertainties at = 10^ GeV^ for the cases where 05 (M|) 
is fixed at the best-fit value or shifted to each of its 1-a limits. Note that in all cases the 
uncertainty bands of the PDFs when as is at its 1-a limits are at most only slightly outside 
those for the PDFs when is at its best-fit value. 

There are a number of interesting features to note from Fig. [11] about the manner in which 
the central values of the PDFs change as a function of as- At high x the valence quarks (and the 
total up and down quark distributions) are anticorrelated with as- This occurs for two reasons. 
Firstly, the higher-order coefficient functions for structure functions in deep-inelastic scattering 
are positive at high x. Increasing as therefore means that we increase this contribution and 
hence require fewer quarks to fit the fixed-target structure function data at relatively low Q^. 
Secondly, the increased speed of evolution with larger as results in more migration from high 
X to lower x values. In absolute terms the effect is similar for up and down quarks, but the 
greater precision on up quarks means that the proportional effect is greater, and for x > 0.5 the 
central value of the default up quark distribution is outside the error bands of the distributions 
generated with as fixed at its 1-a limits. 

Another interesting feature, seen in Fig. [TlTf). is the confirmation of the anticorrelation 
between the small-x gluon and as- This is seen for x between 10~^ and 0.1 at = 10^ GeV^, 
and is a consequence of maintaining the fit quality to the small-x HERA data, i.e. the values 
of 9^2/(9 In asQ- From the momentum sum rule this results in a positive correlation of 
the high-x gluon and as- Note that there is some asymmetry in the deviation. We will return 
to this point in the next section. Since the quark distributions at small x are driven by the 
gluon, this change in the gluon affects the quarks. However, we note that there is in fact a 
slight correlation between the small-x quark distributions and as-, showing that the increase 
in evolution from the increased coupling slightly outweighs the effect of the decreased small-x 
gluon distribution. In Fig. [12] we show similar plots for the gluon distribution at = 10 GeV^ 
in both the NLO and NNLO fits. One can see that, in relative terms, the PDFs at the 1-a 
limits for as are further from the best-fit values at = 10 GeV^ than at = 10^ GeV^, 



24 



(a) 



0.94- 
10* 



Up valence at Q =10''GeV^ 



MSTW 2008 NLO (68% C.L.) 
Fix ttg at +68% C.L. limit 
Fix at - 68% C.L. iimit 




(b) 



Down valence at = 10" GeV^ 
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UpquarkatQ^ = 10''GeV^ 




Down quark at = 10" GeV^ 




Strange quark at Q = 10 GeV^ 




(f) 



GluonatQ =10"GeV' 




Figure 11: NLO parton distributions at = 10^ GeV^, including the 1-a PDF uncertainty 
bands, with as fixed at either the best-fit value or at each of its 1-a limits. 
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(a) 



GluonatQ^ = 10GeV^ 




(b) 



GluonatQ^ = 10GeV^ 




Figure 12: (a) NLO and (b) NNLO gluon distribution at = 10 GeV^ including the 1-a 
PDF uncertainty bands, with as fixed at either the best-fit value or at each of its 1-a limits. 

although the error bands always overlap. This just illustrates that DGLAP evolution drives 
PDFs together at asymptotic values of Q^. 



6 Implications for cross section calculations 

Each fixed value of as is associated with a central PDF set 5*0 and 2n eigenvector PDF sets S"^ 
defined by Eq. ([5]), where k = 1, . . . ,n and n = 20 (corresponding to 20 input PDF parameters). 
An observable PDF-dependent quantity F, such as a hadronic cross section, calculated using 
a particular value of as, has a central value F"s(S'o) and asymmetric PDF uncertainties given 
using the Hessian method by [1,5,70-72] 



J2 {max [ - F-s(5o), F»s[S,) - F-s(So), O] (7) 

k=l 



J2 {max [ F-s(So) - F-s(^+), F»s(So) - F»s(S~), O] }^ (8) 
fc=i 

for a fixed value of as- 

How should this prescription be generalised to calculate an overall "PDF+as" uncertainty 
on an observable F, i.e. accounting for the additional uncertainty on F due to the uncertainty 
on as'? Ideally, we would vary as continuously within its experimental uncertainty determined 
by the global fit. Each value of as would give a central value F"^{So) with PDF uncertainties 
given by Eqs. ([7]) and ([8]). The overall best-fit prediction is then F"s(5'o), where a° is the best- 
fit 05 value, and the overall "PDF+a5" uncertainties are given by the spread in the predictions, 
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including the PDF uncertainty, for each as value. More formally, the "PDF+a5" uncertainties 
are given by 

(AFpo,+„J+ = max({F"«(5o) + iAF,^)^}) - F"s(5o), (9) 
(AFp,,+,,)_ = F'^HSo) - min ({F°^(5o) - {AF.ll).}) , (10) 

as 

where the maximum and minimum are calculated when 05 is varied continuously within, for 
example, the range [a^ — la, + la] to determine the 1-a "PDF+as" uncertainties, and the 
PDF uncertainties are given by Eqs. (I7j) and ([H]) for each value of 0:5 in this range. 

First suppose that 0:5 is varied within a given range for the same central PDFs and their 
uncertainties and that F is a monotonic function of as (which is usually the case). Then the 
extreme values of the observable F would obviously occur when is at either of its limits. 
However, in practice, the PDF uncertainty decreases as 05 gets further from its best-fit value. 
Moreover, correlations between the relevant PDFs and as can enhance the as dependence of 
F, while anticorrelations will reduce it. Therefore, the extreme values of the observable F 
could, in principle, occur at any intermediate value within a given range of as- For reasons 
of economy, we provide PDF sets with uncertainties only for five different fixed values of as 
(i.e. the best-fit as, as fixed at the two limits, and as at half these two limits). In fact we 
find that in the majority of the processes we consider, the extreme values of the observable F 
come from as fixed at either the best-fit value or one of the two limits. Even in the rare cases 
where the extreme values of F come from as fixed at half the limit, omitting these intermediate 
as values would not significantly reduce the overall "PDF+as'" uncertainty. Hence we do not 
consider it necessary to provide PDF uncertainty sets at further intermediate as values. 

Since this prescription might seem quite complicated at first sight, we will give a few concrete 
examples of its application and consequences in the following subsections]^ 



6.1 W and Z total cross sections 

In Fig. [131(a) we show the PDF uncertainties on the Z total cross section at the Tevatron and 
LHC for each of the five sets with different fixed as{M'^) values. (The situation is similar for 
the cross sections.) The cross sections are calculated as described in Section 15 of Ref. [1], 
e.g. using the PDG 2008 [63] electroweak parameters. The increase of the Z cross section with 
increasing as{M^) is due to a combination of two effects. Firstly, there is the effect of the as 
dependence in the (positive) higher-order corrections to the partonic cross section. Secondly, 
there is the as dependence of the (predominantly) quark distributions. In Fig. [T3](b) we show 
the up quark distribution at = M|. The momentum fractions x = Mzl\fs (assuming 
= 0, i.e. LO kinematics) probed at the Tevatron and LHC at central rapidity [y = 0) are 
indicated. As previously noted, the small-x up quark distribution is slightly correlated with 

^We also provide a Fortran example program at Ref. [56]. 



27 



(a) 



(b) 



Z° cross sections with IVISTW 2008 NNLO PDFs 
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Figure 13: (a) PDF uncertainties on the Z total cross sections for each of the five PDF sets 
obtained from global fits performed with different fixed values of 05 (M|) around the best-fit 
value Aq;s'(M|) = 0. The results are shown as the percentage difference from the overall best-fit 
value. The horizontal dotted (dashed) lines indicate the PDF (PDF+q;5) 68% C.L. percentage 
uncertainty on az- The case is very similar, (b) NNLO up quark distribution at = M|. 
The values of x = Mz/y/s relevant for central production (assuming p| = 0) at the Tevatron 
and LHC are indicated. 
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the value of 05 (see also Fig. [TT] for other parton flavours). However, when integrating over 
rapidity to obtain the total cross section, the PDFs will also be sampled at larger (and smaller) 
values of x. From Fig. [T^ b). the up quark distribution is antzcorrelated with 05 in the large x 
region, which will be sampled more at the Tevatron than at the LHC, effectively cancelling out 
some of the correlation with as arising from the sampling of smaller x values. This explains 
why there is less dependence on at the Tevatron compared to the LHC in Fig. [T^a). 

We also indicate in Fig. [TST a) how the spread of the five individual predictions can be used 
to give an overall uncertainty, which is larger than that obtained when ^^(Ml) is fixed at the 
global best-fit value (as is usually done). Note that the extreme values for the Z cross section 
at the Tevatron arise when as(M|) is shifted to half its 1-a limit. In general, there exists a 
X G [0, 1] such that the extreme value of a given observable (or a PDF itself) occurs when 
as{M'^) is shifted by X-a from the best-fit value. Often, perhaps in most cases, X = 1, but in 
other cases X < 1, as we have here for Z production at the Tevatron. However, we see from 
Fig. fT3l(a) that the extreme values of the ilcr results are not far from the extreme values of 
the ±cr/2 results. 

Note that at the LHC there is an asymmetry in the extra uncertainty generated by allowing 
to vary, with more increase in the upwards direction than downwards. This is seen to be a 
consequence of a similar asymmetry in the PDF uncertainty of the up quark distribution when 
as is fixed at its upper limit, shown in Fig. [T3](b). with the PDF uncertainty in the relevant x 
region giving more freedom for upwards movement compared to the best fit than downwards. 
This is due to the fact that the HERA structure function data at small x would prefer a little 
more evolution than in the best global fit at NNLO. The HERA data therefore allow more 
freedom for extra evolution when increa=e= than for rednced evolution wlren decreasesS 

In Tables |2] and [3] we update the NNLO predictions for and Z production at the 
Tevatron and LHC that were given in Ref. [1], to allow for the enlarged uncertainty when the 
variation of as is taken into account l"1 For the NNLO total cross sections, we see that the 
combined "PDF+a^" uncertainty is about 2% at the Tevatron and 2.5% at the LHC. This 
is larger than the estimate of the theoretical uncertainty obtained from scale variation which 
gives below 1% [52]. However, at the LHC, the sensitivity to parton distributions at x values 
< 10~^ could lead to additional uncertainties which are difficult to estimate, but variations of 
PDFs from resummed fits [74] and contributions from small-x resummation to the Drell-Yan 
cross section [75], suggest that a few percent is quite possible. 

^''This effect is obscured slightly in the NLO quark distribution plots shown in Fig. [11] because the ±1ct limits 
for as(M^) itself, mainly determined by fixed-target data, are asymmetric at NLO and larger in the downwards 
direction, see Eq. fS]), but are practically symmetric at NNLO, see Eq. (|4|). 

"'^^In addition to the cross sections presented in Ref. [1], we give results for -y/s = 7 TeV at the LHC in light 
of the recently announced centre-of-mass energy at start-up [73] . 
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Table 2: Predictions for 1^ = + and Z total cross sections at the Tevatron and 
LHC, and their ratio Rwz, with PDF uncertainties only [1] and with the combined "PDF+q;5" 
uncertainty. The 68% C.L. uncertainties are given in all cases. We take ij>r — ij>f — Mw,z- 
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Table 3: Predictions for and total cross sections at the LHC, and their ratio 
with PDF uncertainties only [1] and with the combined "PDF+q;^" uncertainty. The 68% 
C.L. uncertainties are given in all cases. We take fJ^R = (j^f = Mw- 
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6.2 Higgs boson total cross sections 



In Fig. [MT a) we show a similar plot for Higgs boson production via gluon-gluon fusion with 
Mh = 120 GeV. The Higgs cross sections are calculated using the LO subprocess gg ^ H via 
a top-quark loop (with = 175 GeV). Higher-order corrections are included in the — oo 
limit up to NNLO, with only terms up to (1 — z)^ (where z = Mjj/s) included in the expansion 
of the "hard" pieces of the NNLO partonic cross section around the kinematic point z = 1 in 
powers of (1 — z) [76] (this approximation is accurate to better than 1% [76]). 

At LO, the PDF dependence of the Higgs cross section is given simply by the effective 
gluon-gluon luminosity, defined as 

^ = \f^ /.(^i' O = ^h), r= ^, (11) 

OlVl^ S Xi s 

while the as dependence of the partonic cross section is given by an overall a| factor. In 
Fig. [TW a) we show the percentage difference of the trivial a| factor and the gluon-gluon 
luminosity, with respect to the values for the best-fit as-, for each of the five PDF sets. (The 
percentage PDF uncertainty on the gg luminosity for each of the five PDF sets, not shown in 
Fig. [T^a). is very similar to the percentage PDF uncertainty on the NNLO total cross section.) 
The dependence of the gg luminosity on as can be understood by looking at Fig. [TW b). where 
we show the relevant gluon distribution as a function of momentum fraction x. The momentum 
fractions probed at the Tevatron and LHC at central rapidity (y = 0), assuming LO kinematics 
where = and Xi = X2 = Mh / a/s, are indicated. At the Tevatron, y = corresponds almost 
exactly to the crossing point of the three gluon distributions shown in Fig. [T^b). However, 
integrating over xi (or equivalently, rapidity y) in Eq. (fTTj) leads to some contribution from 
larger x values, leading to a slight overall correlation between the gg luminosity and 05 at 
the Tevatron. At the LHC, we see from Fig. [TWb) that y = corresponds to the point of 
almost maximal anticorrelation between the gluon distribution and 0:5. Again, there will be 
contributions from both larger and smaller x values in the integral over xi in Eq. ffTTj) . but there 
is still a substantial anticorrelation between the gg luminosity and as shown in Fig. [TW a). At 
LO, for each fixed as value, the percentage difference of the Higgs total cross section, with 
respect to the value obtained with the best-fit a^, would be approximately given by simply 
adding the percentage differences of the a\ factor and the gg luminosity. However, higher-order 
corrections will significantly increase the as dependence of the partonic cross section beyond 
the LO a\ factor. We see from Fig. [MT a) that at the Tevatron both higher-order corrections 
and the correlation of the gg luminosity with as increase the as dependence of the Higgs total 
cross section. On the other hand, at the LHC, higher-order corrections compensate almost 
exactly for the anticorrelation of the gg luminosity with as-, meaning that the as dependence 
of the total Higgs cross section is surprisingly almost the same as the trivial a| factor. 

In Fig. [TUfa) we also show that the "PDF+a^" uncertainty, indicated by the dashed lines, 
is much enhanced compared to the "PDF only" uncertainty indicated by the dotted lines. Note 
that there is an even more marked asymmetry in the as dependence of the total uncertainty 
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Figure 14: (a) PDF uncertainties on the Higgs total cross sections {Mh = 120 GeV) for each 
of the five PDF sets with different fixed values of as{M'^). The results are shown as the 
percentage difference from the overall best-fit value. We also indicate the as dependence of 
the trivial q;| factor and the gg luminosity (by open squares and triangles respectively, slightly 
offset for clarity), again shown as the percentage difference from the best-fit value, (b) NNLO 
gluon distribution at = Mfj = (120 GeV)^. The values of x = Mh/\/s relevant for central 
production (assuming pff = 0) at the Tevatron and LHC are indicated. 
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for Higgs production than for Z production, with much more of an increase in the upwards 
direction. This is due to the same source. The HERA data demand more of an increase in the 
gluon (with tighter bands) when 0:5 decreases than a decrease in the gluon when as increases; 
see Fig. [TWb). This asymmetry would be absent to first order if the default fit to the HERA 
structure function data was perfect. 

In Fig. [15] we show, as a function of the Higgs mass, how the uncertainty increases when 
variation of as is included. Both uncertainty bands increase in size for increasing Higgs mass at 
the Tevatron as we become more sensitive to the less well-determined high-x gluon distribution. 
At the LHC, the size of the uncertainties is largely independent of the Higgs mass in the range 
considered here, because we are always dominated by the gluon in the region of x ~ 10~^. 
The total "PDF+a^" uncertainty can become comparable to the theory uncertainty estimated 
from scale variation (most recently updated in Refs. [77,78]) of ~ ±10% for high-mass Higgs 
production at the Tevatron, while the uncertainty from unknown higher-order QCD corrections 
dominates at the LHC where the combined "PDF+as" uncertainty is still relatively small. 

6.3 Inclusive jet production 

In Fig. flGT a) we show the "PDF+a^" uncertainty compared to the "PDF only" uncertainty as 
a function of for inclusive jet production in a central rapidity region at the Tevatron and 
LHC. The inclusive jet cross sections are calculated using the fastnlo package [54], based 
on nlojetH — h [79,80], with the fcr jet algorithm. At the Tevatron we show the data-to- 
theory ratio for the CDF Run II data [42], where the error bars represent only the statistical 
uncertainties on the data, and the data points have been shifted by the correlated systematic 
uncertainties determined by the best-fit MSTW 2008 NLO PDFs; see Eq. (38) and Fig. 48 of 
Ref. [1]. Note that the systematic uncertainties on the Tevatron data on inclusive jet production 
are generally very much larger than the statistical uncertainties. The x value probed in the 
PDFs is approximately xt = 1'PTl\fsi and hence the Tevatron and LHC plots show a similar 
trend if comparing the same xt rather than the same pr- The fractional contributions of the gg- 
, gq- and gg-initiated processes to the cross section are shown in Fig. [T6](b). Here, gq implicitly 
includes qg contributions, and q stands for both quarks and antiquarks. At smaller values of 
the additional uncertainty on leads to an increase in the overall uncertainty which is 
due to the largely gluon-initiated cross section being correlated (or at worst uncorrelated) with 
as- At larger values of pr, where the x probed > 0.3-0.4, the cross section is dominated by qq 
scattering. As seen in Section [5], the high-x quarks are highly anticorrelated with as-, and hence 
the uncertainty on 05 does not lead to a further enhancement of the PDF uncertainty. In the 
transition region of intermediate pr, the upper "PDF+a5" uncertainty is enhanced very slightly 
by the presence of the PDF sets with as shifted upwards by cr/2, but in all other regions the 
extreme values are provided by either the best-fit as or the shifts of as by ila. The situation 
is similar in other rapidity bins. The total "PDE+a^" uncertainty is smaller than the theory 
uncertainty at NLO estimated from scale variation, which can be ~ ±10%. However, the PDF 
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Higgs cross sections with IVISTW 2008 NNLO PDFs 

Tevatron.Vs = 1 .96 TeV LHC.Vs = 14 TeV 
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Figure 15: (a) Higgs total cross sections as a function of the Higgs mass at the Tevatron and 
LHC (b) Percentage uncertainty in the Higgs total cross sections when accounting simultane- 
ously for PDF and as uncertainties (outer error bands) as compared to that due to the PDF 
uncertainty alone (inner error bands). 
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Inclusive jet cross sections with MSTW 2008 NLO PDFs 
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Figure 16: (a) "PDF+as" uncertainty compared to the "PDF only" uncertainty as a function 
of pt for inclusive jet production in a central rapidity region at the Tevatron and LHC At 
the Tevatron we also show the CDF Run II data points [42] (statistical uncertainties only), 
(b) Fractional contributions of the gg-^ gq- and gg-initiated processes as a function of px- 
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uncertainty tends to allow more variation in shape as a function of pt, whereas scale variation 
affects mainly the normalisation (for low rapidities). 



7 Conclusions 

Parton distribution functions (PDFs) must be used together with the appropriate value of 
as, since both the input PDFs and as{M'^) are determined simultaneously from global fits 
to deep-inelastic and related hard-scattering data within the framework of leading-twist fixed- 
order coUinear factorisation (in the MS scheme). In a previous paper [1] we determined the 
best-fit values of as{M'^) at leading-order (LO), next-to-lcading order (NLO) and next-to-next- 
to- leading order (NNLO). In this paper we have determined the experimental uncertainties on 
q;5(M|), finding that at 

NLO: a5(M|) = 0.1202 l^ois (68% C.L.) ^Hol (90% C.L.), (12) 
NNLO: a5(M|)= 0.1171 ^^oil (68% C.L.) ^^031(90% C.L.). (13) 

The experimental errors on quoted here were obtained using an extension of the "dynamic 
tolerance" method [1], which gives a much refined revision on the previous estimate of ±0.002 
from the MRST 2001 analysis [47] obtained using a fixed A^gj^j^^j = 20. We did not address in 
detail the important issue of the additional theory uncertainty on our as determination, but 
an estimate was given of ±0.003 at NLO and at most ±0.002 at NNLO. 

In the process of determining the experimental uncertainty on a5(M|) we performed global 
fits with different fixed values of asiM^) in steps of 0.001 for a range of 0.110-0.130 at NLO 
and 0.107-0.127 at NNLO. Public grids for these PDF sets are available from Ref. [56]. These 
PDF sets will be invaluable for as determinations by other groups. 

We then considered, for the first time, the correlation between PDF uncertainties and 
uncertainties on as{M^). By an extension of the "dynamic tolerance" method [1] we showed 
how to consistently account for the effect of both these sources of uncertainty in cross section 
calculations. When «5(M|) is fixed at either of its 1-a limits, then the tolerance for each 
eigenvector PDF set will be one-sided, i.e. zero in one direction and non-zero in the other 
direction. Therefore, the PDF uncertainties will be much smaller (and more asymmetric) when 
0:5 (M|) is fixed at cither of its 1-a limits than when as{M'^) is fixed at its best-fit value. We also 
provide "in-between" PDF sets with as{M^) fixed at half the 1-a limits, where the size of the 
PDF uncertainty is generally smaller than when as{M^) is at its best-fit value, but larger than 
when as{Mz) is at its 1-a hmits. The PDF uncertainty for physical observables, such as cross 
sections, should be calculated separately for each of the five sets (each comprising the best-fit 
PDF set and 40 eigenvector PDF sets) with a5(M|) displaced by {-la, -a/2, 0, +a/2, +la} 
from its best- fit value. Then the combined "PDF+a^" uncertainty is given by the envelope 
of these five predictions. We also provide similar 90% confidence-level (C.L.) PDF sets, where 
as{Mz) is displaced to each of its 90% C.L. limits, and to half these limits. Public grids for 
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all these PDF sets, for use either with the standalone MSTW interpolation code or via the 
LHAPDF interface [81] (from version 5.7.0), are available from Ref. [56]. 

As examples, we calculated the total cross sections for production oiW, Z and Higgs bosons 
at the Tevatron and LHC. For W and Z production, where the LO subprocess is 0{ag) and 
is quark-initiated, there is not a significant enhancement due to the combined "PDF+ag" 
uncertainty as compared to the PDF-only uncertainty with a fixed as- However, the additional 
uncertainty due to as is more important for Higgs boson production via gluon-gluon fusion, 
where the LO subprocess is This is particularly the case at the LHC, where the PDF 

uncertainty (for fixed 0:5) is very small. Finally, we considered the combined 'TDF-l-as" 
uncertainty on the cross sections for inclusive jet production at the Tevatron and LHC as a 
function of pr- The additional uncertainty from as enhances the PDF-only uncertainty at low 
Pt, where the largely gluon-initiated cross section is correlated with as, but not at high pt, 
where the largely quark-initiated cross section is anticorrelated with as- Having discussed these 
few basic processes, the general implications for the "PDF-l-as" uncertainty on new physics 
production (for example, Z' or supersymmetric particles) should be clear by considerations 
such as powers of 0:5, the relevant initiating partons and the approximate x values. For any 
particular process the details of the uncertainty can be explicitly calculated in a straightforward 
way using the tools we have provided in this paper. 
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A Appendix: Definition of a^(Q^) 



There is more than one definition of the strong couphng as(Q^) beyond LO commonly used 
in QCD phenomenology. The various prescriptions are all formally equivalent since they differ 
only at higher orders. The strong coupling asiQ"^) runs according to the renormalisation group 
equation (RGE): 



An) 



dlnQ2 

where the /9-function coefficients up to NNLO are 
The form of as{Q^) used in all previous MRST fits is given by 
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where as{Q'^ ,nf) is defined as the solution of the RGE, Eq. ( lA.ll) . which can be rewritten as 

da 



—a —ha — ca — 



(A.4) 



dlng2 

where a = j3o{nf)as{Q'^ ,nf)/ (Att), b = Piiji j) / [3l{n j) and c = I32{nf) / (3l{nf). The solution of 
this equation in terms of an input parameter A is [82] 



ln(g)=U-Mn(i + 6) 
^ ^ U-4&tan-i 



b+2ac 
Vic-b'^ 



-6 In 



LO 

NLO 

NNLO 



(A.5) 



Note that asiQ"^) defined by Eq. (]A.3P is continuous across the fiavour thresholds, and that the 
fitted parameter A used in solving Eq. (]A.5P is independent of the number of active fiavours 
Uf. This is different from other common prescriptions where it is necessary to choose A to 
be rij-dependent to ensure the continuity of 05 (Q^). The MRST prescription, Eq. (1A.3I) . for 
asiQ"^) is numerically very similar at NLO to the other two principal definitions in use, provided 
that the same input value as{M'^) is taken [8]. However, the value of A determined using one 
prescription for as'(Q^) should not be used in a different prescription. For example, the MRST 
relation between A and as{Q'^) differs from the PDG relation [63], which also differs from the 
CTEQ relation [2,8]. 

By differentiating Eq. flA.3l) it can be shown that asiQ"^) at LO satisfies the RGE exactly, 
and at NLO there is an additional higher-order term of 0{ag), which is sufficiently small to 
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be beyond the desired accuracy. Therefore, the MRST definition Eq. (1A.3P is formally valid at 
LO and NLO, but not at NNLO. In addition, at NNLO, the strong coupling in the MS scheme 
is discontinuous at the heavy fiavour thresholds, = m'jj, where ttlh is the pole mass of the 
heavy quarks, i.e. 

asiml, + 1) = n,) + ^ (^^^^^|^) ' . (A.6) 

These discontinuities in as were not taken into account in any of the MRST NNLO fits, 
including the MRST 2006 NNLO analysis [83] where the corresponding discontinuities in the 
PDFs were included for the first time. (It is theoretically possible to modify Eq. (]A.3P to be 
correct up to NNLO, but only via a much more cumbersome and unattractive relationship.) 
Moreover, rather than use the solution of Eq. (lA.SP directly, the MRST NNLO fits used a 
parameterisation (called qwikalf) of the solution as a degree-5 polynomial in y^\n(Q'^ / A), 
with the coefficients fitted. 

We have changed the definition of as in the recent MSTW analyses [1] to match the def- 
inition used in public evolution codes such as PEGASUS [84] and hoppet [85], that is, we use 
the exact solution of Eq. (lA.ip with fiavour matching using Eq. (1A.6P at NNLO. (Both the 
MRST and MSTW definitions differ from the truncated form used by CTEQ [2,8] and also by 
the PDG [63].) This change of as definition allowed our evolution code to be checked against 
the results from these two independent public evolution codes for the first time [1]. Here, for 
completeness, we numerically compare the MRST definition of a 5 with the MSTW definition, 
taking the same input values of as at 1 GeV, 2 GeV, 10 GeV and Mz- The input values 
at these scales are taken from the best-fit MSTW 2008 analyses [1] at the respective order, 
and the heavy fiavour thresholds are set to = 1.40 GeV and rrih = 4.75 GeV, evolving as 
with a maximum of five fiavours. In Fig. [T7] we show the ratio a|^^^"'"/Q;^^"'"^ as a function 
of the renormalisation scale = Q, using, in turn, each of the four input scales. Note that 
Q,MRST _ Q,MSTW Q^[y fQj^ fQ-^J^J. flavours if the input scale is also taken in the four-fiavour re- 
gion. Otherwise, there are non-negligible differences. The parameterisation qwikalf used in 
the MRST NNLO fits leads to sizable discrepancies compared to the exact result given by the 
solution of Eq. flA.5p . We see that if the MRST and MSTW analyses are forced to have the 
same value of in the region containing the most data {Q^ ~ 20 GeV^) then by = M| a 
discrepancy of more than 0.5% can occur. 
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